List of AI News about compact language models
| Time | Details |
|---|---|
|
2025-10-24 15:35 |
How Nanochat d32 Gains New AI Capabilities: SpellingBee Synthetic Task and SFT/RL Finetuning Explained
According to @karpathy, the nanochat d32 language model was recently taught to count occurrences of the letter 'r' in words like 'strawberry' using a new synthetic task called SpellingBee (source: github.com/karpathy/nanochat/discussions/164). This process involved generating diverse user queries and ideal assistant responses, then applying supervised fine-tuning (SFT) and reinforcement learning (RL) to instill this capability in the AI. Special attention was given to model-specific challenges such as prompt diversity, tokenization, and reasoning breakdown, especially for small models. The guide demonstrates how practical skills can be incrementally added to lightweight LLMs, highlighting opportunities for rapid capability expansion and custom task training in compact AI systems (source: @karpathy on Twitter). |